The stability of different aggregation techniques in ensemble feature selection

نویسندگان

چکیده

Abstract To mitigate the curse of dimensionality in high-dimensional datasets, feature selection has become a crucial step most data mining applications. However, no method consistently delivers best performance across different domains. For this reason and order to improve stability process, ensemble frameworks have increasingly popular. While many examined construction techniques under various considerations, little work been done shed light on influence aggregation process selection. In contribution field, aims explore impact some selected strategies ensemble’s accuracy. Using twelve classification real datasets from domains, accuracy five were four standard filter methods. The experimental analysis revealed significant differences both behavior aggregations, especially between score-based rank-based strategies. Moreover, it was observed that simpler based Arithmetic Mean or L2-norm appear be efficient compelling cases. Given structure associated application domain, work’s findings can guide ensembles using suitable rules.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Feature Selection Using Ensemble Feature Selection Techniques

Robustness or stability of feature selection techniques is a topic of recent interest, and is an important issue when selected feature subsets are subsequently analysed by domain experts to gain more insight into the problem modelled. In this work, we investigate the use of ensemble feature selection techniques, where multiple feature selection methods are combined to yield more robust results....

متن کامل

A Rank Aggregation Algorithm for Ensemble of Multiple Feature Selection Techniques in Credit Risk Evaluation

In credit risk evaluation the accuracy of a classifier is very significant for classifying the high-risk loan applicants correctly. Feature selection is one way of improving the accuracy of a classifier. It provides the classifier with important and relevant features for model development. This study uses the ensemble of multiple feature ranking techniques for feature selection of credit data. ...

متن کامل

Classification Performance of Rank Aggregation Techniques for Ensemble Gene Selection

A very promising tool for data mining and bioinformatics is ensemble gene (feature) selection. Ensemble feature selection is the process of performing multiple runs of feature selection and then aggregating the results into a final ranked list. However, a central question of ensemble feature selection is how to aggregate the individual results into a single ranked feature list. There are a numb...

متن کامل

MLIFT: Enhancing Multi-label Classifier with Ensemble Feature Selection

Multi-label classification has gained significant attention during recent years, due to the increasing number of modern applications associated with multi-label data. Despite its short life, different approaches have been presented to solve the task of multi-label classification. LIFT is a multi-label classifier which utilizes a new strategy to multi-label learning by leveraging label-specific ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Big Data

سال: 2022

ISSN: ['2196-1115']

DOI: https://doi.org/10.1186/s40537-022-00607-1